Machine learning interpretability for a stress scenario generation in credit scoring based on counterfactuals
نویسندگان
چکیده
To boost the application of machine learning (ML) techniques for credit scoring models, blackbox problem should be addressed. The primary aim this paper is to propose a measure based on counterfactuals evaluate interpretability ML technique. Counterfactuals assist with understanding model regard classification decision boundaries and robustness. second contribution development data perturbation technique generate stress scenario. We apply these two proposals dataset UK unsecured personal loans compare logistic regression stochastic gradient boosting (SBG). show that training (SGB) as conditioned our can provide insight into performance under stressed scenarios. empirical results able capture boundary, unlike AUC accuracy widely used in banking sector.
منابع مشابه
Using DEA for Classification in Credit Scoring
Credit scoring is a kind of binary classification problem that contains important information for manager to make a decision in particularly in banking authorities. Obtained scores provide a practical credit decision for a loan officer to classify clients to reject or accept for payment loan. For this sake, in this paper a data envelopment analysis- discriminant analysis (DEA-DA) approach is us...
متن کاملInverse Classification for Comparison-based Interpretability in Machine Learning
In the context of post-hoc interpretability, this paper addresses the task of explaining the prediction of a classifier, considering the case where no information is available, neither on the classifier itself, nor on the processed data (neither the training nor the test data). It proposes an instance-based approach whose principle consists in determining the minimal changes needed to alter a p...
متن کاملOrthogonal support vector machine for credit scoring
The most commonly used techniques for credit scoring is logistic regression, and more recent research has proposed that the support vector machine is a more effective method. However, both logistic regression and support vector machine suffers from curse of dimension. In this paper, we introduce a new way to address this problem which is defined as orthogonal dimension reduction. We discuss the...
متن کاملMachine Learning Model Interpretability for Precision Medicine
Interpretability of machine learning models is critical for data-driven precision medicine efforts. However, highly predictive models are generally complex and are difficult to interpret. Here using Model-Agnostic Explanations algorithm, we show that complex models such as random forest can be made interpretable. Using MIMIC-II dataset, we successfully predicted ICU mortality with 80% balanced ...
متن کاملA hybrid model based on machine learning and genetic algorithm for detecting fraud in financial statements
Financial statement fraud has increasingly become a serious problem for business, government, and investors. In fact, this threatens the reliability of capital markets, corporate heads, and even the audit profession. Auditors in particular face their apparent inability to detect large-scale fraud, and there are various ways to identify this problem. In order to identify this problem, the majori...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Expert Systems With Applications
سال: 2022
ISSN: ['1873-6793', '0957-4174']
DOI: https://doi.org/10.1016/j.eswa.2022.117271